Natural language watermarking
نویسندگان
چکیده
In this paper we discuss natural language watermarking, which uses the structure of the sentence constituents in natural language text in order to insert a watermark. This approach is different from techniques, collectively referred to as “text watermarking,” which embed information by modifying the appearance of text elements, such as lines, words, or characters. We provide a survey of the current state of the art in natural language watermarking and introduce terminology, techniques, and tools for text processing. We also examine the parallels and differences of the two watermarking domains and outline how techniques from the image watermarking domain may be applicable to the natural language watermarking domain.
منابع مشابه
A Natural Language Watermarking Based on Chinese Syntax
A novel text watermarking algorithm is presented. It combines natural language watermarking and Chinese syntax based on BP neural networks. Since the watermarking signals are embedded into some Chinese syntactic structure rather than the appearance of text elements, the algorithm is totally based on the content that can prove to be very resilient. It will play an important role in protecting th...
متن کاملNatural language watermarking: Challenges in building a practical system
This paper gives an overview of the research and implementation challenges we encountered in building an endto-end natural language processing based watermarking system. With natural language watermarking, we mean embedding the watermark into a text document, using the natural language components as the carrier, in such a way that the modifications are imperceptible to the readers and the embed...
متن کاملNatural Language Watermarking Foundations for Individual Marking of Text Data
This paper discuss natural language watermarking, which analyzes patterns inside sentences of a given natural language text document, in order to embed individual watermark messages. The term ”natural language watermarking” stands for the process of the embedding of watermark messages into a text document, using natural language components as the carrier, in such a way that the modifications ar...
متن کاملNatural language watermarking via morphosyntactic alterations
We develop a morphosyntax-based natural language watermarking scheme. In this scheme, a text is first transformed into a syntactic tree diagram where the hierarchies and the functional dependencies are made explicit. The watermarking software then operates on the sentences in syntax tree format and executes binary changes under control of Wordnet and Dictionary to avoid semantic drops. A certai...
متن کاملSyntactic tools for text watermarking
This paper explores the morphosyntactic tools for text watermarking and develops a syntax-based natural language watermarking scheme. Turkish, an agglutinative language, provides a good ground for the syntax-based natural language watermarking with its relatively free word order possibilities and rich repertoire of morphosyntactic structures. The unmarked text is first transformed into a syntac...
متن کامل